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BamHI 10 20 ^ 30 40 

'GGATCC'C AAG GTG GAA CAG TTA TCT CCA GAA GAA GAA GAG AAA AGG AGA 
P D K V E Q> 

LSPEEEEKRR> 

1783 TO > 

2470_2466 TO 2573 OF HUM FOS _0 b : 

50 60 70 80 90 

1^ * * * * * * * * * * 

O ATC CGA AGG GAA AGG AAT AAG ATG GCT GCA GCC AAA TGC CGC AAC CGG 

CI I RRERNKMAAAKC RNR> 

Uj 2500_b_b_251_2466 TO 2573 OF HUM FOS 30 b b 2540b : 

£1 

y 100 110 120 130 140 



AGG AGG GAG CTG ACT GAT ACA CTC CAA GCG GAG ACA GAC CAA CTA GAA 
RRELTDTLQ A> 

E T D Q L E> 

b_2_2466 TO 2573 OF HUM FOS _570b > 

26_2688 TO 3329 OF H : 

150 160 170 180 190 

* ** * ** * ** 

GAT GAG AAG TCT GCT TTG CAG ACC GAG ATT GCC AAC CTG CTG AAG GAG 
DEKSALQTEIANLLKE> 
2710_c c 272_2688 TO 3329 OF HUM FOS _40_c c 2750c : 

200 210 220 230 240 

********** 

AAG GAA AAA CTA GAG TTC ATC CTG GCA GCT CAC CGA CCT GCC TGC AAG 
KEKLEFILAAHRPAC K> 
c_2760 c c 2688 TO 3329 OF HUM FOS 2790 c c 2800_; 



ATC CCT GAT TAATTCAAGC TT — . ^ ^ ^ 

1 p> 'l^W J FIG. 20 
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CNI 



FIG. 22 



>NdeI >BamHI 

I I 

| 10 20 30 40 | 50 

* j * * * * * * * j * * 

ATATACAT ATG GCT AGC ATG ACT GGT GGA CAG CAA ATG GGT CGG GAT CCT 
MASMTGGQQMGR> 
a a a a a_Fl 0 a a a a a > 

D P> 
b > 

60 70 80 90 

* * * * * * * * ★ 

GAC CTG GAA CAA CGT GCT GAG GAA CTG GCC CGT GAA AAC GAA GAG CTG 
DLEQRAEELARENEEL> 
b b b b 4HEPTAD ACIDIC EXTENS ION b b b b > 

>XhoI 
I 

100 110 120 130 140 

* * * * * * * * | * * 

GAA AAA GAG GCC GAA GAG CTG GAG CAG GAA AAC GCT GAA CTC GAG GCG 
EKEAEELEQ_ENAELE> 
b b b b_4HEPTAD ACIDIC EXTENS ION b b__b__b_> 

A> 



150 160 170 180 190 

* * * * * * ** .* 

GAG ACA GAC CAA CTA GAA GAT GAG AAG TCT GCT TTG CAG ACC GAG ATT 
ETDQLEDEKSALQTEI> 
c c c c c C-FOS PROTEIN c C C c c c > 

200 210 220 230 240 

# * * * * * * * * * 

GCC AAC CTG CTG AAG GAG AAG GAA AAA CTA GAG TTC ATC CTG GCA GCT 
ANLLKEKEKLEFILA A> 
c c c c c C - FOS PROTEIN c c c c c c > 

>HindIII 
I 

250 260 270 | 280 

* * * * * * * | * 

CAC CGA CCT GCC TGC AAG ATC CCT GATT AATTCAAGCT T 
HRPACKIP> 
c C-FOS PROTEIN c c > 



90 °NcoI Ndel 

'CC ATG GXC TAC AAG GAC GAC GAT GAC AAG CAT ATG GCT AGC ATG ACT GGT 
MDYKDDDD KHMASMTO 
MDYKDDDDKHMASMTG) 



Flag 96 ° BamW 010 



GGA CAG CAA ATG GGT CGG GAT CCT GAC CTG GAA CAA CGT GCT GAG GAA 
GQQMGRDPDLEQRAEE> 
GQQMGRDPDLEQRAEE> 



1020 4 heptad 

* 

CTG GCC CGT GAA AAC GAA GAG CTG GAA AAA GAG GCC GAA GAG CTG GAG 
LARENEELEKEAEELE> 
LARENEELEKEAEELE) 



Xhol 



1080 



CAG GAA AAC GCT GAA CTC GAG GCG GAG ACA GAC CAA CTA GAA GAT GAG 
QENAELEAETDQLEDE> 
QENAELEAETDQLEDE> 



AAG TCT GCT TTG CAG ACC GAG ATT GCC AAC CTG CTG AAG GAG AAG GAA 
KSALQTEIANLLKEKE> 
KSALQTEIANLLKEKE> 



1140 Fos Leucine zipper 

* 

AAA CTA GAG TTC ATC CTG GCA GCT CAC CGA CCT GCC TGC AAG ATC CCT 
KLEFILAAHRPACKIP> 
KLEFILAAHRPACKIP> 



GAT TAA GCTT 

S :> HindUI 



FIG. 23 
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>NdeI >BamHI 

I I 
4090 4100 4110 4120 | 4130 

* j * * * * * * * * * 

ATACAT ATG GCT AGC ATG ACT GGT GGA CAG CAA ATG GGT CGG GAT CCC GAC 
MA SMTGGQQMGR> 
a a a a a_Fl 0 a a a a a > 

D P D> 



>XhoI 
I 

4140 4150 4160 4170 4180 

GAA GAG GAA GAT GAC GAA GAA GAA CTC GAG GAA CTG GAA GAC AGC TTT 

EEEDDEEELEELE> 
b b b b b POLY - GLU b b b b b > 

D S F> 



CAC AGT TTG CGG GAC TCA GTC CCA TCA CTC CAA GGA GAG AAG GCA TCC 
HSLRDSVPSLQGEKAS> 
c c c c c MAX BHLH DOMAIN_c c c c c C > 



CGG GCC CAA ATC CTA GAC AAA GCA ACA GAG TAT ATC CAG TAT ATG CGA 
R A Q I LDKATEYIQYM R> 
c c c c c MAX BHLH DOMAIN_c c c c c c > 



AGG AAA AAC CAT ACG CAC CAG CAA GAC ATT GAT GAC CTC AAG CGG CAG 
RKNHTHQQDIDDLKRO> 
c c c c c M AX BHLH DOMAIN_c c c c c c > 



AAT GCT CTT CTG GAG CAA CAA GTC CGT GCA CTG GAG AAG GCA AGA TCA 
NALLEQQVRALEKAR S> 
c c c c c M AX BHLH DOMAIN_C c c c c c > 

>HindIII 
I 

4380 4390 4400 

* * * * * | * * 

AGT GCC CAA CTG CAG ACC TGAGGCAA GCTTATC 
S A Q L Q T> 
; BHLH DOMAIN_c > 



FIG. 26 



>NdeI >BamHI 

i i 

4090 4100 4110 4120 | 4130 
* | * * * + * * ** * 
ATATACAT ATG GCT AGC ATG ACT GGT GGA CAG CAA ATG GGT CGG GAT CCT 
MASMTGGQQMG R> 
a a a a a_Fl 0 a a a a a > 

D P> 
b > 

>XhoI 
I 

4140 4150 4160 4170 | 

GAC CTG GAA AAA GAG GCC GAA GAG CTG GAG CAG GAA AAC GCT GAA CTC 
DLjEKEAEELEQENAE l> 
b b TWO AMPHIPATHIC HEPTAD S (1st PHASE- 783) b b > 

L80 4190 4200 4210 4220 

********** 
GAG CTG GAA GAC AGC TTT CAC AGT TTG CGG GAC TCA GTC CCA TCA CTC 

E L E> 
b b > 

DSFHSLRDSVPSL> 
c c c c_MAX BHLH DOMAIN C c c c > 

4230 4240 4250 4260 4270 

********** 
CAA GGA GAG AAG GCA TCC CGG GCC CAA ATC CTA GAC AAA GCA ACA GAG 
QGEKASRAQ ILDKAT E> 
c c c c c M AX BHLH DOMAIN_c C c c c c > 

4280 4290 4300 4310 4320 
********* 
TAT ATC CAG TAT ATG CGA AGG AAA AAC CAT ACG CAC CAG CAA GAC ATT 
YI QYMRRKNHTHQQDI> 
c c c c c MAX BHLH DOMAIN_c c c c c c > 

4330 4340 4350 4360 4370 

* 

GAT GAC CTC AAG CGG CAG AAT GCT CTT CTG GAG CAA CAA GTC CGT GCA 
DDLKRQNALLEQQVR A> 
c c ccc MAX BHLH DOMAIN_c c c c C c > 

>HindIII 
I 

4380 4390 4400 4410 | 4420 

** * ** * * * j * * 

CTG GAG AAG GCA AGA TCA AGT GCC CAA CTG CAG ACC TGA GGCAAGCTTA 

LEKARS S AQLQ T> 
c c c MAX BHLH DOMAIN_c C C C > 

TC 



FIG. 27 

>NdeI >BamHI 
| I 

4090 4100 4110 4120 | 4130 

ATATACAT ATG GCT AGC ATG ACT GGT GGA CAG CAA ATG GGT CGG GAT CCT 
MASMTGGQQMGR> 
a a a a a_Fl 0 a a a a__ — a > 

D P> 
b > 

>XhoI 

I 

4140 4150 4160 4170 

* I 

GAC CTG GAA AAA GAG GCC GAA GAG CTG GAG CAG GAA AAC GCT GAA CTC 
DLEKEAEELEQENAEL> 
b b TWO AMPHIPATHIC HEPTAD S (2ND PHASE- 784) b b > 

.80 4190 4200 4210 4220 

GAG GAA CTG GAA GAC AGC TTT CAC AGT TTG CGG GAC TCA GTC CCA TCA 
EEL E> 
TWO AMPK > 

DSFHSLRDSVPS> 
c c c MAX BHLH DOMAIN_c c c c > 

4230 4240 4250 4260 4270 

* * ** * ** * * * 
CTC CAA GGA GAG AAG GCA TCC CGG GCC CAA ATC CTA GAC AAA GCA ACA 
LQ GEKASRAQILDKAT> 
c c c c c MAX BHLH DOMAIN_c c c c c c > 

4280 4290 4300 4310 4320 
********* 
GAG TAT ATC CAG TAT ATG CGA AGG AAA AAC CAT ACG CAC CAG CAA GAC 
E yiQYMRRKNHTHQQD> 
c c_ c c c MAX BHLH DOMAIN_c c c c c c > 

4330 4340 4350 4360 4370 

ATT GAT GAC CTC AAG CGG CAG AAT GCT CTT CTG GAG CAA CAA GTC CGT 
j D DLKRQNALLEQQVR> 
c c c c c MAX BHLH DOMAIN_c c c c c c > 

>HindIII 
I 

4380 4390 4400 4410 4420 

** * ** * ***|* 

GCA CTG GAG AAG GCA AGA TCA AGT GCC CAA CTG CAG ACC TGAGGCAAGC 

ALEKARS S A Q L Q T> 
c c c c_MAX BHLH DOMAIN c c c c > 



TTATC 
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>NdeI >BamHI 
I I 

4090 4100 4110 4120 | 4130 

ATATACAT ATG GCT AGC ATG ACT GGT GGA CAG CAA ATG GGT CGG GAT CCT 
MASMTGGQQMG .R> 
a a a a a_F 1 0 a a a a a > 

D P> 

b > 

>XhoI 

I 

4140 4150 4160 4170 | 

GAC CTG GAA AAA GAG GCC GAA GAG CTG GAG CAG GAA AAC GCT GAA CTC 
DLEKEAEELEQENAEL> 
b b TWO AMPHIPATHIC HEPTAD S (3D PHASE- 785) b b > 

180 4190 4200 4210 4220 

GAG GAA GAG CTG GAA GAC AGC TTT CAC AGT TTG CGG GAC TCA GTC CCA 
E E E L E> 
TWO AMPHIPAT > 

DSF HSLRDSVP> 
c c cJMAX BHLH DOMAIN c C c > 

4230 4240 4250 4260 4270 

********** 

TCA CTC CAA GGA GAG AAG GCA TCC CGG GCC CAA ATC CTA GAC AAA GCA 
SLQGEKA5RAQILDK A> 
c c c c c M AX BHLH DOMAIN_c c c c c c > 



ACA GAG TAT ATC CAG TAT ATG CGA AGG AAA AAC CAT ACG CAC CAG CAA 
TEYIQYMRRKNHTHQQ> 
c c c c c MAX BHLH DOMAIN_c C c c c c > 



GAC ATT GAT GAC CTC AAG CGG CAG AAT GCT CTT CTG GAG CAA CAA GTC 
DIDDLKRQNALLEQQV> 
c c c c c MAX BHLH DOMAIN_c c c c c c > 



>HindIII 
I 

4420 



CGT GCA CTG GAG AAG GCA AGA TCA AGT GCC CAA CTG CAG ACC TGAGGCA 

RALEKARSSAQLQ T> 
c c c c MAX BHLH DOMAIN_c c c c c > 



AGCTTATC 



BamHI 10 20 30 40 

** * ** * ** * 

GGATCCC AAC GAC AAG AGG CGG ACA CAC AAC GTC TTG GAA CGT CAG AGG 
NDKRRTHNVLERQR> 
PUTATIVE; NCBI GI : 50468; CODON_START=l ; C-MYC : 

50 60 70 80 90 

* * * * ★ * * * * * 

AGG AAC GAG CTG AAG CGC AGC TTT TTT GCC CTG CGT GAC CAG ATC CCT 
RNELKRSFFALRDQ I P> 
PUTATIVE; NCBI GI : 50468; CODON_START= 1 ; C-MYC PROTEIN; : 

100 110 120 130 140 

* * * * * ** * * * 

GAA TTG GAA AAC AAC GAA AAG GCC CCC AAG GTA GTG ATC CTC AAA AAA 
ELENNEKAPKVVILKK> 
PUTATIVE; NCBI GI : 50468; CODON_START=l ; C-MYC PROTEIN; : 

150 160 170 180 190 

* * * * * * * ** 

GCC ACC GCC TAC ATC CTG TCC ATT CAA GCA GAC GAG CAC AAG CTC ACC 
ATAYILS IQADEHKLT> 
PUTATIVE; NCBI GI : 50468; CODON_START= 1 ; C-MYC PROTEIN; : 

200 210 220 230 240 

* ** * ** * * * ★ 

TCT GAA AAG GAC TTA TTG AGG AAA CGA CGA GAA CAG TTG AAA CAC AAA 
SEKDLLRKRREQLKH K> 
PUTATIVE; NCBI GI : 50468; CODON_START=l ; C-MYC PROTEIN; : 

250 260 270 

* * * * * * * 

CTC GAA CAG CTT CGA AAC TCT GGT GCA TAA AAGCTT 



LEQLRNSGA 
PUTATIVE; NCBI GI : 50468; CODON_ 



HindHI 
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